-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deduplicate tz locations against root #5759
Conversation
5e0fbb2
to
5209eea
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I had mixed feelings about this, since mixing data from multiple CLDR versions could cause problems. But, I think the benefits outweigh the drawbacks. See #5763.
locations_root: | ||
<R::ZoneLocations as DateTimeNamesHolderTrait<tz::LocationsV1Marker>>::Container<()>, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thought: This makes DateTimeFormatter<YMDHMSZ>
a little bit bigger in stack size.
time_zone/locations@1, <total>, 1979623B, 1951879B, 204 unique payloads | ||
time_zone/locations@1, af, 8161B, 8025B, 1cb532d1ca53b263 | ||
time_zone/locations@1, ak, 8179B, 8043B, 61a5ec381137eb44 | ||
time_zone/locations@1, <total>, 1435127B, 1407337B, 204 unique payloads |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Praise: 1'979'623B to 1'435'127B is a big enough improvement that I'm happy landing this despite my other comments.
A lot of location names are the same in many languages.
For exemplar cities we have root data, however for territories we don't. For territories I'm resolving the
und
name by using a Latin-script endonym (i.e. for regionXY
, resolveund-Latn-XY
), and if that fails, just use English (not perfect, but it's better than the raw region code for both display purposes and deduplication).